# Low-latency voice processing
Ultravox V0 4
MIT
Ultravox is a multimodal voice large language model based on Llama3.1-8B-Instruct and Whisper-medium, capable of processing both voice and text inputs simultaneously.
Audio-to-Text
Transformers Supports Multiple Languages

U
fixie-ai
1,851
48
Postmalone
A model for real-time voice conversion, capable of high-quality voice style transformation
Speech Synthesis
Transformers

P
sail-rvc
1,679
1
Featured Recommended AI Models